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Convolutional Codes with Maximum Column Sum 
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Abstract —The column Hamming distance of a convolutional 
code determines the error correction capability when streaming 
over a class of packet erasure channels. We Introduce a metric 
known as the column sum rank, that parallels column Hamming 
distance when streaming over a network with link failures. 
We prove rank analogues of several known column Hamming 
distance properties and introduce a new family of convolutional 
codes that maximize the column sum rank up to the code 
memory. Our construction Involves finding a class of super¬ 
regular matrices that preserve this property after multiplication 
with non-singular block diagonal matrices in the ground field. 

Index Terms —Column distance, maximum rank distance 
(MRD) codes, network coding, super-regular matrices, 
maximum-distance profile (MDP) codes. 


I. Introduction 

In streaming communication, source packets arrive sequen¬ 
tially at the transmitter and are only useful for playback 
by the receiver in the same order. Erased packets must be 
recovered within a given maximum delay or be considered 
permanently lost. Streaming codes recover packets within 
these decoding deadlines and have previously been studied 
for single-link communication El-El. The works referenced 
in Ei-a focused primarily on low-delay recovery against 
burst losses, which are the predominant erasure patterns in 
Internet streams @. Alternatively, 0, Q considered coding 
for channels with arbitrary erasure patterns, restricting only the 
number of erasures in a window. It was shown that the column 
Hamming distance determines the maximum tolerable number 
of erasures that can occur in any window of the stream for 
decoding to remain successful. If there are fewer erasures than 
the distance in every sliding window, each source symbol is 
recovered within a given delay. A family of memory m convo¬ 
lutional codes, known as m-Maximum Distance Separable {m- 
MDS) codes, attain the maximum column Hamming distance 
up to the code memory. Furthermore, these are constructed 
from block Toeplitz super-regular matrices El, El-lII- These 
codes can also be used as constituent codes in the construction 
of optimal burst error correction codes for streaming systems 

El. 

Suppose that a transmitter sends packets to several users 
through a series of intermediate nodes. Using generation-based 
linear network codes, the problem of decoding is reduced to 
inverting the channel transfer matrix between the transmitted 
and received packets cni-iiia. If links in the network fail 
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to transmit packets for a given time instance, the rank of the 
channel matrix decreases, making packet recovery infeasible. 
One solution is to use end-to-end schemes to precode chan¬ 
nel packets before transmission. Rank metric codes such as 
Gabidulin codes Ol, M are capable of protecting packets in 
rank-dehcient channels. The minimum rank distance of a code 
determines the maximum permissible rank loss in a channel 
matrix. This method can be considered for single-shot network 
coding EH, EH or for multishot extensions Ell, El- 

In this work we study a streaming setup that extends the 
single-link model of El-El to a network. We assume that 
the intermediate nodes implement linear network coding and 
produce a channel transfer matrix relating the transmitted and 
received packets. To combat link failures, the source stream 
is further precoded at the source using a streaming code. We 
dehne a new metric, the column sum rank and introduce a new 
family of convolutional codes that attain the maximum value 
for this metric. These are rank metric analogues of m-MDS 
codes El- Just as the column Hamming distance determines 
the maximum allowable number of erasures in single link 
streaming, we show that the column sum rank determines the 
maximum rank deficiency of the channel. Interestingly, there 
has been little prior work on rank metric convolutional codes. 
To our knowledge, the only previously studied construction 
appears in ED, where the authors consider the active column 
sum rank as the metric of importance. Consequently, their 
approach and results differ from the present work both in the 
code constructions and applications. 

This paper is outlined as follows. The network streaming 
problem is introduced in Section |II] We provide an overview 
of rank metric block codes and m-MDS codes in Section 
Uni The column sum rank is dehned in Section IIVI where 
we derive several properties and establish their relevance in 
network streaming. Codes that maximize the column sum 
rank are referred to as Maximum Sum Rank (MSR) codes. 
We introduce a class of super-regular matrices in Section [V] 
that preserve super-regularity after multiplication with block 
diagonal matrices in the ground held and use these to construct 
an MSR code in Section |Vll We conclude this paper with code 
examples and a discussion on the necessary held size. 

H. Network Streaming Problem 

The streaming problem is dehned in three steps; the encoder, 
network model, and decoder. Encoding is performed in a 
causal fashion, as the incoming packets are not known until 
the time at which they must be transmitted. A linear network 
code has been applied to the network and each node receives 
and sends linear combinations of the symbols in the channel 
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packet. Consequently the network is abstracted to a channel 
matrix, which is assumed to be known to the receiver m. 
The decoder observes linear combinations of the symbols in 
each transmitted packet and must recover the source within 
the imposed deadline. 

A. Encoder 

Let q be a prime power and M > 0. At each time instance 
f > 0, a source packet Sj G arrives at the transmitter 
node. A channel packet Xt G F^m is constructed via a causal 
function of the previous source packets 7 t(so,..., St). We 
consider the class of linear time invariant encoders. A rate 
i? = L encoder with memory m generates the channel packet 

m 

^ ^ — i ' Gri; (1) 

using a set of generator matrices G, G for 0 < i < m. 

B. Network Model 

The transmitter node sends one channel packet through the 
network at each time instance. Although there is a natural 
delay in the end-to-end transmission due to link delays, we 
assume that such delays are sufficiently small so that one time 
instance contains the encoding, transmission, and decoding of 
a single channel packeQ. The transmission of a single channel 
packet over one time instance is referred to as a shot. In 
each shot, the destination node observes yt = XjAj, where 
At G is the channel matrix at time t, and is known 

to the receiver. In practice, the coefficients for the linear 
transformations applied by each node can be encoded into 
header bits, which the receiver uses to reconstruct the channel 
matrix El. 

Each shot is independent of all others. Communication over 
a window \t,t + W — 1] of kL shots is described using 
y[t,t+w-i] = X[t_t+w-i]A[t,t+iy_i], where A[t_t+VF-i] = 
diag (At,..., At+w-i) is a block diagonal channel matrix 

El, El- Let pt=rank(At) denote the rank of At, for all 
t > 0. The sum of the ranks of the individual channel matrices 
is equal to the rank of the channel matrix in the window, 
i.6., Pi = rank (A[t,t-i-w-i]). Suppose that at any 

time instance, a link in the network may fail to transmit its 
intended symbol. Intermediate nodes that do not receive an 
intended symbol simply do not include that symbol in the 
linear combination they transmit. If all links are functional in 
the shot at any time t, then pt = n, but failing links may result 
in a rank-deficient channel matrix at that time. One failing link 
can eliminate at most one of the min-cut paths connecting the 
transmitter and receiver. It follows that rank(Ai) is reduced 
by at most 1 for every failing link ll20l . We introduce a sliding 
window model to characterize rank deficiencies in the network. 

Definition 1. Consider a network where for all f > 0, 
the receiver observes y* = xjAt, with pt=rank(At). The 

* For example in audio streaming, coded speech packets are generated once 
every 20 ms. When the propagation delays are much smaller than this value, 
they can be ignored. 


Rank-Deficient Sliding Window Network C'H{S, W) has the 

property that in any sliding window of length W, the rank of 
the block diagonal channel decreases by no more than S, i.e.. 
Pi ^ ~ ^ for each t > 0. 

In analysis, we disregard the linearly dependent columns 
of the channel matrix and the associated received symbols. 
At each time instance, the receiver effectively observes y^ = 
XjA*, where A* G contains only the linearly indepen¬ 

dent columns of At and is referred to as the reduced channel 
matrix. 

Remark 1. A sliding window model has been used in prior 
works on delay-constrained coding over single-link channels 
a, M- In these works, the channel is adversarially permitted 
to erase symbols in each channel packet up to a maximum 
number of erasures within each sliding window. The Rank- 
Deficient Sliding Window Network can be viewed as an 
extension of this model, where the channel introduces rank 
deficiencies rather than erasures. 

C. Decoder 

Let T be the maximum delay permissible by the receiver 
node. A packet received at time t must be recovered by 
time t + T using a delay-constrained decoder, i.e., St = 
T]t{yo, ■ ■ ■ ,yt+T) is the reconstructed packet. If the decoded 
source packet s* is equal to Sj, then the source packet is 
perfectly recovered by the deadline; otherwise, it is declared 
lost. A linear code C over F^m is defined as feasible for 
C'H{S, W) if the encoding and decoding functions for the 
code are capable of perfectly recovering every source packet 
transmitted over the channel with delay T. 

In this paper we will assume that the window length satisfies 
W = T -\-1. A source packet St must be decoded by time 
f -f T at the receiver. Thus its active duration spans the interval 
\t,t -f T], which maps to a window length of W = T + 1. 
Nevertheless, we will also discuss how our codes can handle 
the case when W f T + f. 

Our objective in this paper is to construct codes that 
guarantee recoverability under the worst channel conditions 
for a fixed delay and rate, i.e., identifying the largest rank 
deficiency S for which a code with a given rate is feasible. 
Towards this end, we introduce a new metric called the column 
sum rank distance of a convolutional code. We show that it 
is both a necessary and sufficient metric to determine the 
maximum rank deficiency from which the code guarantees 
perfect recovery. Thus, maximizing S reduces to finding codes 
with maximum column sum rank distance over the interval 
[0,T]. 

Codes which achieve the maximum column sum rank 
distance will be referred to as Maximum Sum Rank (MSR) 
codes in this paper. Furthermore the column sum rank distance 
possess a profile property. Achieving the maximum distance at 
one point implies that it is also maximized at all points before 
it. Operationally we show that this property guarantees that the 
product of the generator matrix with elements of a specific set 
of channel matrices is full-rank. Finally we propose a family of 
super-regular matrices that permit this multiplication property 
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of the generator matrix, which we then use to construct MSR 
codes. Our proposed family of codes uses the properties of 
rank metric block codes and m-MDS convolutional codes, 
which are introduced as preliminaries in the following section. 


III. Background 
A. Rank Metric Codes 

Consider a vector x £ F”m over the extension field. We 
refer to x as a channel packet. The vector x over the extension 
field is isomorphic to an n x M matrix over the ground field 
Fq. Formally, a bijective mapping : F^m jpnxM allows 
for the conversion of this vector to a matrix over the ground 
field. We more thoroughly detail this mapping in Appendix lAl 
Noting that a normal basis can describe every element in the 
extension field, the elements of F^m are mapped to linearized 
polynomials evaluated at a normal element, whose coefficients 
form column vectors. A row vector in F”m then maps to a 
matrix whose columns are the coefficients of the corresponding 
linearized polynomials. 

The rank of x is defined as the rank of its associated matrix 
(/)„(x). The rank distance between any two vectors x, x £ F^m 
is defined as 


dii(x,x) = rank((/)„(x) -(/)„(x)). 


The rank distance is a metric and is upper bounded by the 
Hamming distance . For any linear block code C[n, k] over 
F^m, the minimum rank distance is defined as the smallest 
rank amongst all non-zero channel packets. Similar to the 
minimum Hamming distance, the minimum rank distance of 
a code must satisfy a Singleton-like bound, i.e., dnlC) < 
min — fc)-|-l 1131 . We simplify the notation when 

C is obvious. It is assumed that M > n from here on; dji 
is then bounded exactly by the classic Singleton bound. Any 
code that meets this bound with equality is referred to as a 
Maximum Rank Distance (MRD) code. Such codes possess 
the following property. 

Theorem 1 (Gabidulin, ifTsl ). Let G £ F^m" be the generator 
matrix of an MRD code. The product of & with any full-rank 
matrix A £ F^^*^ satisfies rankGA = k. 

A complementary theorem was proven in ifTSl for the parity- 
check matrix of an MRD code. We use the equivalent generator 
matrix property, which arises from the fact that the dual of an 
MRD code is also an MRD code 1131 . 

Gabidulin codes are an important family of MRD codes. To 
construct a Gabidulin code, let po,..., gn-i € F^m be a set of 
elements that are linearly independent over F^. The generator 
matrix for a Gabidulin code C\n,k] is given by 


G 


/ 50 

5 ™ 


5i 


si*-" 


■ • ■ gn -1 \ 

Hn-l 


[fe-1] / 

in-l ' 


where we use the notation to denote the j-th Frobe- 

nius power of 5 £ F^m (see Appendix lAll. Gabidulin codes 
can be applied directly as end-to-end codes over networks 


where a channel matrix A £ F"^" transforms symbols of 
the transmitted channel packet ifTSl . OH. For a source packet 
s £ F^m encoded by a Gabidulin code, a receiver observes 
y = sGA. By Theorem [T] the product GA is an invertible 
matrix as long as rank A > k. 


B. The Column Hamming Distance 

Let C[n, k, m] be a linear time-invariant convolutional code, 
where m is the code memory. For a source packet sequence 
S[Q ^] = (sq, ..., Sj) £ the channel packet sequenc^l 

X[oj] = S[o_j]G®^ is determined using the extended form 
generator matrix 


/Go 


^EX ^ 


Gi 

Go 


G, 

G,-i 




V Go; 


( 2 ) 


where Gj £ F*^" and Gj = 0 for j > m ll22l Chapter 1]. 
We assume from here on that Go always has full row rank. 
This guarantees that G|"^ also possesses full row rank, which 
is a property used in subsequent results. 

The Hamming weight of X[o is a sum of the Hamming 
weight of each channel packet xt for 0 < t < j. The j-th 
column Hamming distance of a convolutional code is defined 


dH{j)= min wtff(x[oj]), 

X[o,3]eC.so#0 


as the minimum Hamming weight amongst all channel packet 
sequences for which the initial source packet Sq is non¬ 
zero Q, ES. Note that because Go is full-rank, So 7 ^ 0 
immediately implies that Xo 7 ^ 0 as well. 

Several properties pertaining to the column Hamming dis¬ 
tance were treated in 0,121. We summarize two relevant ones 
below. In Section |IV] we prove analogous properties to these 
for the rank metric. 


Property 1 (Tomas et al., 0). Consider an erasure channel 
being used for each t > 0, where the prior source sequence 
S[o,t_i] is known to the decoder by time t + j. If there are at 
most dnij) — 1 symbol erasures in the window [t, t + j], then 
St is recoverable by time t -I- j. Conversely, there is at least 
one hypothetical channel window [t,t + j] containing dnij) 
erasures for which St is not recoverable by time t + j. 

Property [T] states that for sliding window erasure channels 
featuring windows of length W, a convolutional code with 
column Hamming distance dniW — 1) can guarantee perfect 
decoding with delay W — 1, provided that there are less than 
dniW — 1 ) erasures in the window 0 . 

Property 2 (Gluessing-Luerssen et al., O). The j-th column 
Hamming distance of a code is upper bounded by a Singleton¬ 
like bound, i.e., dnij) < (tt—fc )0 + l) + l- If meets this 

bound with equality, then dnii) meets its respective bound for 
all i < j. 


^In network coding literature, each xt G referred to as a generation 

of n channel packets. We denote xt as a channel packet containing n symbols 
and X[j as a sequence of j packets. Similar notation is used for the 
source. 
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This property asserts a convolutional code extension of the 
Singleton bound. The desirability of achieving large column 
Hamming distances is given in the previous Property □ In 
conjunction with this, a code capable of recovering from 
d-Hij) — 1 erasures with delay j can be shown to be further 
capable of the respective maximum recovery for all i < j. 

There exist several families of codes which achieve the 
maximum dnij) for some given j I?), || 8 |, ll24l . One such 
class of codes are m-MDS codes. These achieve the upper 
bound up to the code memory i.e., dnU) = {n — k){j + l) + l 
for 0 < j < m. The MSR codes, which we introduce in 
this work, are rank metric analogues of m-MDS codes. One 
approach to constructing the generator matrix of m-MDS 
codes is by taking a sub-matrix of fc(m -f 1 ) rows from a 
block Toeplitz super-regular matrix ||7]. A prior construction 
of a block Toeplitz super-regular matrix was given in Q- As 
this construction is modified for our purposes, we include a 
summary of this construction in Section |V] as well as a review 
of super-regular matrices in Appendix 

C. The Active Column Sum Rank Distance 

Let C[n, fc, m] be a linear time-invariant convolutional code 
over ¥qM, whose codewords are generated using (El. The 
active column sum rank distance is a metric for convolutional 
codes that was proposed in a prior work lIT^ . This metric is 
defined using the state Trellis graph of the convolutional code. 
Let C“ be the set of all channel packet sequences X[o that 
are constructed by exiting the zero-state of the Trellis at time 
0 and not re-entering it for 1 < t < j — The j-th active 
column sum rank of a linear convolutional code C[n, k, m] is 
then defined as as the minimum sum rank of all channel packet 
sequences in C“, i.e., 

dnij) - min „ 

where </)„(•) is the previously introduced mapping from vectors 
in the extension field to matrices in the ground field in 
Section IIII-AI 

Note that by restricting itself to only consider channel 
packet sequences in C“, the active column sum rank of a 
convolutional code does not impose any guarantees on the sum 
rank of any channel packets that enter the zero state before 
time j. An example of such a sequence is provided in Fig. [T| 
Consequently, the active column sum rank is not a sufficient 
metric to guarantee delay-constrained decoding over a sliding 
window channel, as we show in the next section. 

IV. The Column Sum Rank Distance 

Let C[n,k,m] be a linear time-invariant convolutional code 
over F^m , constructed in the same manner as in the previous 
section. We introduce the j-th column sum rank distance of a 
code 

j 

dnij) = min ^ rank ((/)„(xt)), 

X[o,3]eC.S0540^ 


as an analogue of the column Hamming distance. Unlike the 
j-th active column sum rank distance, this metric permits 
returning to the zero-state before time j. For example, the 
channel packet sequence generated in Fig. [T] is valid for 
the column sum rank distance. As a result, this metric is 
stronger than the active version, i.e., dnij) < d%{j). In the 
following theorem, we show that the column sum rank distance 
of a convolutional code is both necessary and sufficient to 
guarantee low-delay decoding over C'H{S, W). 

Theorem 2. Let C[n,k,m] be a convolutional code used over 
the window [0, W — 1]. For 0 < f < kU — 1, let Aj € 
be full-rank matrices and w-i] ~ (-^Oi ■ ■ ■ ^ ^w-i) 

be a channel matrix. The following statements are true: 

1) If dii{W — 1) > nW — ’^hen Sq is always 

recoverable by time W — 1. 

2) If dii{W — 1) < nW — '^hen there exists at 

least one channel packet sequence and channel matrix for 
which So is not recoverable by time W — 1. 

Proof: Due to code linearity, we only need to show that 
all output channel packet sequences are distinguishable from 
the all-zero sequence. We prove this by contradiction. Consider 
a source packet sequence S[o^n/-i] = (sq, • ■ • ,sw-i), where 
So ^ 0. Suppose that this sequence generates the channel 
packet sequence X[o_h^_i], for which X[o_vv_i]A^p = 0 . 
This implies that rank ((/)„(xt)) < n —rank(A() for 0 <t < 
W — 1. By summing each of the inequalities, we arrive at the 
following contradiction on the sum rank of the channel packet 
sequence: 

w-i w-i 

rank((j„(xt)) < nW - pt 

t—0 t—0 

<dn{W-l). 

For the converse, let S[o,iy_i] = (sq, ..., sw-i), with 
So ^ 0 be a source packet sequence that maps to X[o_vv-i]> 
for which rank((/)„(xt)) = daiW — 1). For 0 < f < 

VF — 1, let Pt = n — rank (j)„(xt)). There exist matrices 
A* g ^nxpt xtAt = 0. We let A*p = 

diag (Ag,..., be the channel matrix. Summing all 

of the Pt reveals rank(Ap = nW — d[i{W — 1). 

Furthermore, S[o,vv_i] is indistinguishable from the all-zero 
source packet sequence over this channel. ■ 

Remark 2. The constraint that so ^ 0 is necessary in 
order to differentiate the first source packet from the all¬ 
zero source packet. Note however, that there is no necessary 
constraint on the state Trellis transitions; any non-zero source 
packet sequence should be differentiable from the all-zero 
sequence. Using the active column sum rank in place of the 
column sum rank in the above Theorem then leads to a partial 
guarantee. If X[o^n/-i] G then the active column sum 

rank determines the maximum rank deficiency in the channel 
from which the first source packet is recoverable. However, 
if X[o_vv_i] then the W — 1-th active column 

sum rank does not provide any guarantees on recoverability. 
Consequently, we view the active column sum rank as an over¬ 
estimate of the recovery capability of the code. 
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Fig. 1; A hypothetical transition along the state Trellis graph is highlighted. The active column sum rank distance (i^(5) does 
not guarantee the sum rank of the channel packet sequence generated by this path, i.e., X[o_ 5 ] ^ Cf. In this work, we introduce 
the column sum rank, which does consider this channel packet sequence. 


For time-invariant encoders. Theorem |2] can be used to 
guarantee that all source packets are recovered with delay 
at most W — 1 over a sliding window channel. Assuming 
all prior packets have been decoded, we recover each St 
using the window [t^t + W — 1]. The contributions of S[o_t_i] 
can be negated from the received packet sequence used for 
decoding. Theorem|2]is then effectively a rank metric analogue 
to Property [T] from Section ITlI-BI which describes how column 
Hamming distance bounds the number of tolerable erasures in 
single-link streaming Q. 

Remark 3. Aside from rank-deficient channel matrices, adver¬ 
sarial errors can also be considered using rank metric codes. 
Consider a single-link single-shot system where the receiver 
observes y = x -f e, with e G F^m being an additive error 
vector. If rank(0„(e)) < the decoder for an MRD 

code can recover the source II3. MRD codes reflect a rank 
analogue of the error correcting capability of MDS codes. It 
can easily be shown that the column sum rank can ensure xq 
is recoverable by time j in the channel y[oj] = X[oj] -l-e[o,j], 
if the sum rank of ejQ is constrained to be at most 

. The proof follows similarly to that for Theorem |2] 

We next propose an analogue to Property |2] from Section 
Erl First, we bound the column sum rank by dR{j) < 
{n — k){j + 1) + 1. The sum rank of a channel packet cannot 
exceed its Hamming weight, meaning that the upper bound 
on column Hamming distance is inherited by the rank metric 
analogue. Furthermore, we show that if the j-th column sum 
rank achieves its upper bound, all prior column sum ranks do 
so as well for their respective bounds. 

Lemma 1. If dni^j) = {n — k){j -I- 1) -f 1, then dR{i) = 
{n — k){i + 1) + 1 for all i < j. 

Proof: It suffices to prove for i = j — 1. Let C[n, k, m] be 
a code for which dii{j — 1 ) does not attain the upper bound 
i.e., dnij - 1 ) < {n-k)j, but dfi{j) achieves the maximum 
i.e., dR{j) = (n — fc)(j-|-l)-|-l. We will argue by contradiction 
that such a code cannot exist. 

Consider a source packet sequence S[oj_i] that gener¬ 
ates X[Q whose sum rank equal to dii{j — 1) i.e.. 


LLo= dnij - 1) holds. We argue that 
this sequence can be augmented to include x^, such that 
^^^orank(0„(xt)) < (n - k)(j + 1) < dij(j) holds. This 
will complete the contradiction. 

To exhibit such a choice of xj recall that Xj = 
+ SjGo. The summation up to j — 1 produces 
a vector whose Hamming weight is at most n. Because 
rank (Go) = k, the source packet Sj can be selected specif¬ 
ically in order to negate up to k non-zero entries of the 
first summation. This implies that wtn (xj) < n — k and 
consequently, rank (())„(xj)) < n — k. Therefore, we bound 
the sum rank of X[o_j] 

3 

y^rank((/)„(xt)) = d_R(j - 1 ) -f rank ((/)„(xj)) 

i =0 

< dnij -l)+n-k 
<{n- k){j + 1 ), 


as required. ■ 

Codes achieving the Singleton bound for drj(m) are referred 
to as MSR codes. They directly parallel m-MDS codes, which 
maximize the m-th column Hamming distance 121 . In fact, 
since dR{j) < dnij), MSR codes automatically maximize 
the column Hamming distance and can be seen as a special 
case of m-MDS codes. 

By Theorem |2] an MSR code with memory T = W — 1 
recovers source packets with delay T, when the rank of the 
channel matrix is at least k{T -f 1) in each sliding window 
of length W. We prove the existence of MSR codes in the 
next section, but first discuss a matrix multiplication property 
for the generator matrix. The following theorem serves as an 
extension of Theorem [T| to convolutional codes transmitted 
over independent network uses. 


Theorem 3. For 0 < f < j, let 0 < p* < n satisfy 

t 

^Pi<k{t-\-l) (3) 

for all t < J and with equality for t = j. The following are 
equivalent for any convolutional code: 

1 ) dnij) = {n-k){j -f 1 ) -f 1 
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2) For all full-rank = diag ( Aq ,..., A*) constructed 
from full-rank blocks A^ S and pt that satisfy 

the product G®^A*p is non-singular. 

Proof: We first prove 1 => 2. Suppose there exists 
an A*p whose blocks satisfy O}, for which G^^A^q 
is singular. Then there exists a channel packet sequence 
X[Q_j], where X[Q j]A*p^.j = 0. We show that this leads to a 
contradiction of 1. The contradiction is immediate if Xg 0. 
In this case the sum rank of X[oj] is at least dfi{j), i.e., 
((/)„(xt)) > must hold. Note however that: 

3 3 

^rank {(t>n{M)) < n{j + l)-'^pt 

t^O t^o 

= {n-k){j + l) 

contradicts dii{j) = (n — k){j -f 1) -f 1. Note that we use Q 
with equality at t = j in the second step. 

If Xq = 0, then the sum rank of X[oj] is not constrained 
by d]i{j). Let I = argmin^ xj ^ 0 be the smallest index for 
which X; is non-zero and consider the channel packet sequence 
X[;_j], whose sum rank is at least dii{j — l). Because Xt A* = 0 
for t = I,... ,j, we bound rank ((/)„(xi)) < n — p* in this 
window. The sum rank of X[i is bounded: 

3 3 

^rank((/)„(xt)) < n{j - I + 1) - '^pt 
t — l t — l 

<{n- k){j -l + l). 

The second line follows from J2t=i Pt — ^ + 1)’ which 

can be derived when Q is met with equality for t = j. Due 
to Lemma [H the column sum rank achieves dn^j — 1) = (n — 

k) {j — / +1) -b 1. The sum rank of X[; is less than d[i{j — 1), 
which is a contradiction. 

We prove 2 1 by using a code with dii{j) < (n — k){j -b 

l) and constructing a full-rank A^g for which Gj'^A^g is 

singular. Let m = argmin^ (irj(i) < {n — k){i + l) be the first 
instance where the column sum rank fails to attain its upper 
bound and consider the sequence X[q with the minimum 
column sum rank. We show that there exist full-rank matrices 
A* satisfying both ([^i and xt A* = 0 for 0 < t < m. 

In addition, we aim to have equality in Q at f = m and thus 
A*g will be of dimension (m -b l)n x {m + l)k. This is 
relevant later in the proof. 

When 777. — 0, the column rank of Xq cannot exceed ti k. 
For every po < n — rank ((/)„(xo)), there exists an Aq for 
which xqAq = 0. Clearly we can always choose an Aq with 
rank pQ = k. 

When 777 > 0, the sum rank of X[q q satisfies 

t 

rank((/)n(xi)) > (n - k){t + 1) + 1 (4) 

for 0 < f < 777 — 1, and 

m 

rank((/>n(x^)) < (n - k){m + 1). (5) 

Let Pt = 77 —rank ((/)„(xt)) for 0 < f < 777 — 1 and choose the 


appropriate A* for which xj A^ = 0. For all 0 < f < 777 — 1, 
we have that: 

t t 

= 77(f-b 1) - ^rank((/)„(xi)) (6) 

z—0 z—0 

<fc(f-bl)-l, (7) 

confirming that Q is satisfied for t < m—1. Note that in (|7]i 
we apply the inequality in (|4]i. We next specify an appropriate 
choice for pm- We will select: 

771—1 

Pm = {m-\- l)k - Pt , (8) 

t^o 

and show that there exists an associated A^ G jp^xpm 
will satisfy XmAJjj = 0. It thus suffices to show that pm also 
satisfies pm < n — rank((/)„(xm)). Note that 

m — 1 

n - Pm = n - {m l)k Pi (9) 

z^O 

m—1 

= (n - k){m + 1) - rank(0n(xi)) (10) 

z^O 

m 

= (t7 - k){m + 1) - ^ i'ank((/)„(xi)) 
i=0 

-b rank((/)„(xm)) (11) 

> rank((/)„(x™)), (12) 

where (doll follows by using (ISll with f = 777 — 1 and dnii 
follows via the inequality in (|5]l. Finally, our choice (l8]l also 
guarantees that A^q has dimension (777 -b 1)77 x (777 -b l)fc 
as claimed. 


The remaining ..., A* can be any full-rank n x k 

matrices, thus satisfying (O for all t < j. The product 
G^^^A^q can be written as 


^3 


A 


[0.7] - 







1 ^ 

* 

[0,m] 

X A* 

^^[m+l,i] 


VA* 

^^[m+lj] 



where X and Y denote the remaining blocks that comprise 
G|"^. The block GJ^^A*q is a square matrix with a zero 
determinant. Therefore, det G|"^A^p is also zero. ■ 

Although in this work, we do not propose a general de¬ 
coding algorithm for MSR codes, we remark that decoding 
in the network streaming problem can be reduced to matrix 
inversion. Consider a scenario where an MSR code with 
memory T = FF — 1 is used and that all source packets before 
time t have been recovered. To ensure that St is recoverable 
within its deadline of time t + T, we consider increasingly 
larger windows \t,t + j] for 0 < j < T. Theorem |2] states 
that if Pi ^ k{t + j + 1), then the decoder cannot 

guarantee recovery by time t + j. The window length j 
is incremented at each time instance up to the first point 
where Pi > k{t + j + f) is achieved. By Theorem 

12 the rank conditions in ([2 are satisfied and G^^AJj 
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is non-singular. We therefore invert this matrix and solve for 
S[t recovering all packets in the window simultaneously. 
Consequently, packets encoded by an MSR code can be 
recovered for the network streaming problem with complexity 
0((jfc)3). 

In the next section, we construct an extended generator 
matrix. Theorem [3] is then useful afterwards to verify that the 
generator matrix does in fact define an MSR code. 

V. Preservation of Super-regularity 

In Q, the authors provided a construction for a Toeplitz 
super-regular matrix that exists in F, for a prime q. From 
©, the generator matrix of a convolutional code is block 
Toeplitz and we focus on super-regular matrices with similar 
structure. For simplicity, we consider block Hankel matrices 
and later convert the structure to block Toeplitz before code 
construction. 


A. Block Hankel Super-regular Matrices 

A block Hankel super-regular matrix construction, which 
we outline below, was proposed in ||9] for F^m where q is a 
prime power and M is sufficiently large. 


Theorem 4 (Almeida et al., 191). For n,m G N, let M = 
q"’("*+2)-i Let Q, g be a primitive element and a root 
of the minimal polynomial pdX). For 0 < j < m, let the 
be defined by 


blocks Tj G F|^ 


/ 


T = 

-*■7 


alM] 

q,["7 + 1] 


q,["7 + 1] 
q ,[" 7 + 2 ] 


q,["(7 + 1)] 


yQ,["(7-l-l)-l] q,["(7 + 1)] 

Then the following block Hankel matrix 


T = 


/ 


VTo 


^[nU+ 2 )- 2 ] y 


To\ 

Ti 


Tm-l TmJ 


is super-regular. 


(13) 


(14) 


We refer the reader to 0 for the full proof. In this 
subsection however, several properties of T that lead to its 
super-regularity are highlighted. They will be useful in the 
subsequent section. 

Let Tj{r, s) be the element in the r-th row and s-th column 
of Tj, where 0<r, s<n — 1 and 0 < j < m. Each entry 
of the matrix is a linearized monomial evaluated at X = a, 
in particular 0, 


Tjds) = 


(15) 


The g-degrees of these monomials increase as one moves 
further down and to the right inside any T^. This can be 


^We use the variable X when discussing properties of linearized polyno¬ 
mials and evaluate the polynomials at X = a specifically when calculating 
the determinant of a matrix. However, we suppress the dependence on X 
whenever it can be determined from the context. 


described with the following inequalities: 

degqr,(r',s) > degqTj(r, s), 0<r<r'<n — l, (16a) 

degqrj(r, s') > degq Tj(r,s), 0<s<s'<n — 1, (16b) 

for all 0 < j < m. 

Combining all of the Tj to construct T in (fl4li preserves 
the property of q-degrees strictly increasing downwards (along 
a single column) or rightwards (along a single row) now for 
the block Hankel matrix, i.e., 

degqTj,{r', s) > degqTj(r, s), 0 < j < / < m, (17a) 

degq Tj/(r, s') > degqTj(r, s), 0 < j < / < m, (17b) 

for all 0 < r, r' < n — 1 and 0 < s, s' < n — 1. 

To show that T is super-regular, let D be any I x I sub-matrix 
of T. All sub-matrices of T preserve the degree inequalities 
in (fThl l and (fTTl i. T being super-regular is equivalent to 
det D 0 for every D with a non-trivial determinant. 

The determinant is evaluated using the Leibniz formula as a 
polynomial D{a) — sgn((T)i7£r. Each non-zero term 

in the formula, is a product of linearized monomials 

i-i 

Da = ]qL>(i,cr(i)). 
i=0 

Note that the determinant polynomial itself is not linearized. 
Using (fT6l l and ( fTTl i. the degree of this polynomial can be 
bounded. 

Lemma 2 (Almeida et al., 0). Eor T defined in (O, let 
D be any square sub-matrix with a non-trivial determinant 
(see Appendix I A-Bk Let D{X) be the polynomial, such that 
D{a) = detD. The degree of D{X) is bounded by 

1 < degD{X) < 

In 0, the matrices defined by (fOl l contained entries with 
Erobenius powers with fixed q = 2, rather than for a general 
q. A direct extension reveals that the bounds in Lemma |2] still 
holds when using Erobenius powers with arbitrary q in (foT l 
provided that M > g"(™+2)-i Wg refer the reader to 0, 
ll25l for the full proof. 

An arbitrary q and appropriate choice of a permits the 
super-regular blocks Tj to resemble Gabidulin code gener¬ 
ator matrices. We require this slight generalization since our 
channel in Section El can operate over any field F^. 

The upper bound gives degi9(X) < degpti(X), which 
implies that, Pa{X) \ D{X). Consequently, a cannot be a 
root of D{X) (see Lemma|4]of Appendix I aTi. 

The lower bound in our extension does not change from the 
original. It implies that D{X) is not the zero polynomial and 
can be derived by algorithmically finding a unique permutation 
d = argmax^ deg Da{X), which generates the highest degree 
monomial term in the Leibniz formula. Because tr is unique, 
there is no other term which can negate this monomial. Then, 
degD{X) = deg Da{X), meaning D{X) is not the zero 
polynomial. 

The upper and lower bounds cover both potential cases 
to ensure D{q) ^ 0. Therefore, any D with a non-trivial 
determinant is also non-singular and consequently, T is a 








super-regular matrix. 

A block Toeplitz or Hankel super-regular matrix can be 
used to construct the extended generator matrix of an m- 
MDS code jSl. We will use a similar technique to construct 
MSR codes. Super-regularity alone however, is insufficient 
for the network streaming problem. We show that the super¬ 
regular matrix construction technique above can be modified 
to produce the desired property for the network streaming 
problem. 


B. Preservation of Super-regularity after Multiplication 

In this section, we connect the matrix multiplication prop¬ 
erty in Theorem|3]to super-regularity. The block Hankel matrix 
T from (fl4l) is constructed using an a that is both a primitive 
and a normal element of the field F^m . We show in this case 
that the super-regularity of T is preserved after multiplication 
with any block diagonal matrix in the ground field. 

Theorem 5. For 0 < t < m, let A* € be 

any non-singular matrices. We then construct A[o_m] = 
diag (Aq, ..., Am). Let T S ]p™b"+i)x’T’(™+i) supgj-. 

regular matrix in (fl4l l. If M > g"(™+2)-i ^nd a is a primitive 
normal element of F^m, then F = TA[o,m] is super-regular. 

Several properties of F are introduced before proving The- 
orem|5] To simplify the notation, we assume that Aq = Ai = 
• • • = Am = A. This allows free use of the polynomial 
structures and bounds from the previous subsection. We will 
then explain why the proof extends immediately to the general 
case. Hence, A[o,m] = diag (A,..., A), where A e F""""- is 
a non-singular matrix in the ground field and we consider 
properties of the product 

F = TA[o,m]- 

Property 3. The product F = TA[q m] is a block Hankel 
matrix, whose blocks each possess a Moore structure. 


The product can be written as 


F = TA 


[0,m] 


VToA 


ToA\ 

ToA TiA 

Tm-iA TmAy 


Let Fj = TjA for j = 0,... ,m. To show that Fj has a 
Moore structure, let (/g,..., /n-i) be the first row of Fg, 


i.e.. 


(/o,...,/„_i) = (a[°l,...,aI”-d)A. 


Each element f is a linearized polynomial fi{X) evaluated 
at A = a, with coefficients from A = [A; i], i.e.. 


/, = (18) 
1=0 

Since a € F^m and A; ^ G F^, invoking Freshman’s rule 

/,(aW) = /W(x), s>0, (19) 


results in the following Moore structure. 


F ■ — 


/ flM] 
Jo 

Jo 


/] 




V/o 

for 0 < j < m. 


/: 


clnj + l] 


[n(j + l)-l] 


AM] \ 
Jn-l 
AM-f-1] 
Jn-l 


AnU+A-M 
J n— 1 , 


( 20 ) 


Remark 4. Since a is normal over F^m and A is full rank, 
(/o,..., fn-i) are linearly independent over Fg (see ifOl l. 


Property 4. The g-degrees of the polynomial elements of Fj 
strictly increase downwards on any fixed column. They are 
not necessarily monotonic across a fixed row. 

From (fTsl l and (fT^ . the g-degree of each linearized poly¬ 
nomial is bounded by 

s < degq/]'''(A) < n - 1-b s (21) 

for 0 < s < n{m -I- 1) — 1. Note that both the upper and 
lower bounds in (1211 1 do not depend on the column index i. 
Furthermore all elements on any fixed row in Fj share the 
same degree (c.f. (l20l i). Consequently, the polynomial entries 
on any fixed row of the block Fj share the same bound and 
are not necessarily in an increasing order. Thus we do not 
have a counterpart of (llbbl i that guarantees that the q-degrees 
of elements are monotonically increasing across each row in 

T,- 

The counterpart of (IIbal l is satisfied however. Let Fj{r,s) 
be the element in r-th row and s-th column of Fj, i.e., 

F,ir,s) = fl^^+^\ ( 22 ) 


where 0<r, s<n—1 and 0 < j < m. The Moore structure 
of Fj implies that the g-degrees of the polynomial entries 
strictly increase from top to bottom along any fixed column 
of Fj, i.e., 

degqFj (r',s) > degqFj (r, s), 0 < r < r'< n - 1, (23) 


for all 0 < j < m. 

The following lemma establishes the relations between g- 
degrees of the polynomial entries inside F. 


Property 5. The following inequalities 

degq Fj, (/, s) > degq (r, s), 0 < g < j' < m, (24a) 

degq Fp (r, s') > degq Fj (r, s), 0 < g < j' < m, (24b) 

hold for all 0 < r, r' < n — 1 and 0 < s, s' < n — 1. 

Proof: The inequality in (I24al l is established by 
degq Fp (r' , s) - degq Fj (r, s) 

= degqdegq 

= (ng' -b r') -b degq /« - (nj + r) - degq /., 

> n -b r' — r 

> 0, (25) 

where the first inequality follows from g' > g -b 1 , and the 
second due to r' — r > —n. 
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Using (ISTT i. we prove ( I24bl ). 

degq Ff (r, s') - degq Fj (r, s) 

= degq/K+^l-degq/i"^+’'l 

> nj' + r — (n — 1 + nj + r) 

> 0. (26) 


Here, the first inequality follows from (1211 1. and the second 
due to f > j + 1. ■ 

We note that (l23T l. (I24al l and (I24bb parallel the inequali¬ 
ties (I16ab . (I17ab and (I17bb for entries of T. It remains to hnd 
an equivalent relationship of (I16bb for the entries of F. 

To state the next property we let ^nd 

R-j G be tbe f-th column and row blocks of F, 

i.e., 


F= (Co.Ci,--- ,C^) 


/Ro\ 

Ri 

VRm/ 


(27) 


where 






(O,--- ,0,Fo,Fi,--. ,F,). (28) 


0 

Fo 

Fi 

VfJ 

The authors in 13 showed the following property. 


Property 6. Let D be an Z x Z sub-matrix of F possessing a 
non-trivial determinant. The matrix D can be written as 


/ 

o,, 


\ 

o 

h-1 





D/i-i 

D„ 

Oo 




\ Do 



/ 


(29) 


where 0 < Zi < m. Each is a zero matrix drawn from a 
single row block of F, whereas each is a matrix containing 
non-zero polynomials drawn from a single column block of 
F. Let Zi e {1,..., n} be the number of columns in each Di 
for f e {0,..., h}. Then, 


Remark 5. In the special case of h = m, Di consists of 
columns drawn from the non-zero elements of Ci, whereas 
Oi consists of rows drawn from the zero elements of Rm-t- 
When h < m, it can be shown that D preserves the structure 
in (l29b . If the j-th column block Cj of F was skipped when 
generating the submatrix D, one has to skip the (m — j)-th 
row block Rm_j to avoid generating a trivial sub-matrix D. 
We refer the reader to 13 for the complete proof. 


Remark 6. The degrees of the polynomials entries of D 
satisfy (|2^ . (I24ab and (I24bb . 


Lemma 3. For 1 < Z < n, let be Z distinct 


elements of the set {0,... ,n — 1}. There exists a full-rank 
matrix M G such that the q-degrees of 


are monotonically increasing. Moreover, the q-degrees of the 
polynomial entries of the product (/|*^,/|*^,...,are 
also sorted in the same order, for all 0 < s < n(m + 1) — 1. 

Proof: As discussed in Remark 01 because a is a nor¬ 
mal element of F^m and A is full rank, the polynomials 
{fii ! /i 21 ■ • ■ I /ii) linearly independent over Fg. As a result, 
the isomorphic matrix 

■ • -AnUii)) 


is full column rank and can be transformed to reduced column 
echelon form ((?)(/* J, ..., through elementary 

column operations, i.e., 

= (</'n(/ii),</'n(/*2), ■ • ■ M, 

where M G F^^* is full rank matrix in the ground held. This 
is equivalent to saying that the degrees of the polynomials 
ifii,---, fii ) are strictly increasing. 

The second part immediately follows, as 
(/!:’,j;’) by using 
Freshman’s rule. ■ 


C. Proof of Theorem 0 

Having established the various properties in the previous 
section we are now in a position to complete the proof of 
Theorem |5] 

Proof of Theorem^ Consider any nontrivial minor D of 
F. We will show that det(D) 0. First note from Property 0] 
and 13 the elements of D satisfy ( l23l l. (I24ab and (I24bb . which 
are the counterparts of (I16ab . (I17ab and (I17bb respectively. We 
will show that D can be transformed into a matrix D that 
satishes the counterpart of (I16bb . 

Using Property 0 we have that the structure of D should 
satisfy (123 . Consider the sub-matrices Dq, ..., D/j associated 
with D. Each Dj is contained entirely within a column 
block of F. Thus it follows from (123 that any row of Dj 
can be expressed in the form ifl^\ fll\ ■ ■ ■, flf) for some 
0 < s < n(m-l-l) —1 and 0 < ii < i 2 < ■ ■ ■ < < n — 1. 

In particular the indices are common across all 

rows in Dj and only the q-degree, denoted by s, varies 
across the rows. Using Lemma [3 we construct a full-rank 
matrix Mj G Fj" \ such that the degrees of the polynomials 
of Dj = Dj • Mj will increase monotonically across the 
columns within each fixed row. Performing this for each Dj, 
we construct 

D = D • M, (30) 

where M = diag (M?i,..., Mq) is full-rank with elements 
in Fg. It follows that the transformed matrix D satishes the 
counterpart of (I16bb . 

We argue that the remaining conditions are also preserved 
under the transformation of M. Note that the elements on any 
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given row of Dj are obtained through linear combinations 
(over Fq) of the elements in the corresponding row of Dj. 
Since both (12111 and (l22l i are satisfied by all elements belonging 
to a fixed row of Dj, they must also be satisfied by Dj. 
Then the proof of (I24bl i. (l23T l and (I24al i can be carried out 
for elements in Dj. 

Consequently, D completely satisfies (fTbl l and (flTT i. in 
contrast to D which only satisfies (l23T l and (1241 . The proof 
showing that D is non-singular then follows verbatim to the 
proof showing all sub-matrices of T are non-singular. Then, 
det D = det D det M implies that D is also non-singular and 
therefore, F is super-regular. 

Recall for this proof, we fixed Aq = Ai = • • • = Am- 
Without the assumption, the product F would be comprised 
of a different set of fi{a) for each column block. Because 
each Di is a sub-matrix from a single column block and 
column operations are performed for each D^ independently, 
the polynomial degrees can always be transformed in order 
to satisfy (fTbl) and (fTTl l. Consequently, the proof is identical 
when considering a A[o „j] constructed from different blocks. 

■ 

We summarize the main steps in the proof of Theorem |3 
as follows. A comparison of our proof and the original proof 
from |j9| is provided in Fig. |2] 

1) We start by constructing the matrices Tj in (fOl l for 0 < 
j < m with a being a primitive normal over F^m . 

2) We consider the product F = TA[q j„], where T is 
the block Hankel matrix in (fTHi and A[o,m] is a block 
diagonal matrix in the ground field. 

3) We next consider non-trivial sub-matrices D of F and 
study their structure and ordering of the g-degrees (Prop¬ 
erties lll|5]|6]l. 

4) In Lemma |3] we show that the matrix D can be trans¬ 
formed into the matrix D, where the g-degrees are sorted 
as required by Lemma |2] 

5) Applying Lemma |2] it follows that D is non-singular. In 
turn this also implies that D is non-singular. 

6 ) This is equivalent to showing that F is super-regular and 
the Theorem follows. 


satisfies Theorem\^ Then, 


/ 





aW 


a[3l\ 




aW 

aPl 


aW 





aPl 

a[3l 

aW 






a[3l 

aW 





aPl 

a[3l 

aW 

a[5l 



all] 


a[3l 

aW 


a[6l 



a[2l 

a[3l 

aW 



aPl 

a[8l 

q ;[®1 

Va[3l 





a[8l 

q ;[®1 



is a super-regular matrix. 


Let Aq and Ai be the following two non-singular square 
matrices: 


/o 

1 

1 

oV 


/I 

0 

0 


1 

0 

0 

0 


1 

1 

1 

0 

1 

0 

1 

0 

; Ai = 

0 

1 

0 

1 

VO 

0 

0 

V 


lo 

1 

0 

0; 


We use A[q]^] = diag(Ao,Ai) as the block-diagonal 
matrix and generate the product F given in (EB. 

Because Aq A\, the matrix F is not a Hankel ma¬ 
trix. However, note that every block matrix posses a Moore 
structure. The elements in every row of a given block are 
Frobenius powers of the corresponding elements in the first 
row. Since a is a normal element in F 22048 and Aq and 
Ai are full rank, the isomorphism 4>n{-) in Appendix L4-AI 
implies that the entries generating the left column block 
-I- are linearly independent poly¬ 

nomials. Similarly, the entries generating the right column 
block -f -f -f -f are also 

respectively linearly independent amongst each other as stated 
in Remark^ Note that the q-degree of these polynomials are 
upper and lower bounded between 3 and 0 which satisfies (EB- 


D. Example Illustrating the Proof of Theorem 0 

In this subsection, we provide an example of the preserva¬ 
tion of super-regularity, detailing the properties of F and the 
key concepts in the proof of Theorem |5] 

Example 1. Let g = 2,n = 4, to = 1 and a be a primitive 
normal element of F 2 M, where M = g"(™+2)-i = 2048 


Furthermore, the q-degrees are sorted according to Prop¬ 
erty IB On a given column of F, the q-degrees are increasing 
by 1 from top to bottom. Moreover, for any fixed row, any 
polynomial in the left block has a lower degree than any 
polynomial in the right block. 

Now consider the sub-matrix formed from rows Tifi S 













+ 

aW 

aW 

+ 

a[2l 

+ 

a[3l 

aOl 


+ 

aPl\ 











aW 

+ 

aPl 

aPl 

+ 


+ 

aW 


aW 

+ 

a[31 











aPl 

+ 

a[31 

a[31 

+ 

aW 

+ 


a[3l 

aPl 

+ 

aW 











a[3l 

+ 

aW 

aW 

+ 


+ 

a[6l 

aW 

a[3l 

+ 


aW 

+ 

a 

[2] 



+ 

a 

[2] 

a[3l 

aW 

+ 



+ 


+ 



aW 

+ 

a[6l 


+ 

a 

[ 3 ] 

aO] 

alO 

+ 

a 

[ 3 ] 

aW 


+ 

a[6l 

a[6l 

+ 


+ 




+ 

aPl 


+ 

a 

[ 4 ] 

a[2l 


+ 

a 

[ 4 ] 



+ 

aPl 

aPl 

+ 

a[8l 

+ 




+ 

a[8l 


+ 

a 

[ 5 ] 

a[3l 

a[3l 

+ 

a 

[ 5 ] 


aPl 

+ 

a[8l 


+ 

q;[®1 

+ 

aOo] 



+ 
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Construct Tj where a is 
both primitive and normal 


F = TA is a Hankel Matrix 


Structure and Properties of a 
non-trivial sub-matrix D of F 


Construct D from D such 
that 5 -degrees are sorted 


Apply Lemma 12 to show D is non-singular 


Since D is non-singular, so 
is D and F is super-regular 

(b) MSR Proof Summary 


Fig. 2: A summary of the proof of Theorem |5] compared with the original proof given in ||9|- The highlighted sections denote 
the key differences. 


{1,2,4, 5} and columns Cj,j £ {3,4, 5,6}. This matrix 


/ 

alH + 

-f Q;[3] _(_ q,[4] 

a[2l\ 


a[2l + a[31 

a[31 -b QfW -b 

a[3l 

a[3l 

-1- a [3] 

Qfl®] -b 


VaW 

a[5l + a[6l 

Qfl®] -b + q;[®] 



does not have increasing q-degrees along rows, i.e., (fT^ is 
not satisfied. The degree of the polynomials in the second and 
fourth columns are equal and lower than the degree of the 
polynomials in the third column. The degrees can be sorted 
by subtracting the fourth column from the second and then 
swapping the third and the fourth columns. The transformed 
matrix 


/ 

aW 


a[2l+a[3l+aW\ 


aPl 

a[3l 

-b ctW -b al®! 

alol 

aW 


Qfl®] -b + oi'F'^ 

Vain 

a[3l 

a[6l 

Qfl®! -b ell'll -b Q;[®y 


completely satisfies (O and (O. The above column opera¬ 
tions are equivalent to right multiplication with the matrix. 


/l 0 0 0\ 

M= 0 1 0 0 

0 0 0 1 ’ 
yo 1 1 0/ 

which is full-rank. It follows that D is non-singular and 
therefore, D as well. 


VI. Code Construction 


The rows of T are permuted to form the block Toeplitz 
structure of an extended generator matrix 


/To 


T = 


Ti 

To 


Tm \ 

Tm-1 

To 


02 ) 


Since every sub-matrix of T has a counterpart in T identical 
up to row permutations, this block Toeplitz matrix is also 
super-regular. is then constructed as a sub-matrix of 

k{m-\-l) rows from T. This process parallels the construction 
of m-MDS generator matrices Q. 


Theorem 6. Let T be the super-regular matrix in (l32l l 
generated using a primitive normal a G F^m, where M = 
gr!,(m+ 2 )-i 0 < < • • • < Zfc < n and construct a 

k(m-hl) X n(m-l-l) sub-matrix of T from rows indexed 
jn-hii,..., jn-\-ik for 0 < j < m. This matrix is the extended 
generator of an MSR convolutional code C[n, k, m]. 

Proof: We show that Gj^^ satishes Theorem |3] Assume 
without loss of generality that ii — Q,... fik = k — 1. Each 

is divided into ’ where G^ £ are the blocks of the 

extended generator matrix. For 0 < f < to, let A* G be 

non-singular matrices. We similarly divide At = [A* AQ, 
where the two blocks A} £ and A{ G 

represent the reduced channel matrix and some remaining 
matrix respectively. Let A[q „j] = diag (Aq, ..., A^). The 
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product can be written as 


TA 


/TqAo TiAi 
TqAi 


[0,m] 


T A 

m-^m 

T 1 A 


ToAm / 


where 


T,At 


/G*A* G*A'\ 

[t'a; T[A'J- 


The sub-matrix of TA[o „j] containing only the rows 
and columns involving G^A^ is equal to the product 
GJ^^A^p Because TA[q „j] is super-regular, the determi¬ 
nant of G^^Ap is either trivially zero or non-trivial and 
therefore non-zero. 


It can be shown that for all A*p whose blocks satisfy (0, 
the product 


G^^A 


[0,r: 


/GoA5 GiAl 
GoAl 


\ 


GmA*^ \ 

Gm-1 A^ 


Goa;, 


has a non-trivial determinant. Note that the structure is re¬ 
versed from that of D in ( |29] |. In ||9l, the authors showed 
that for D to have a non-trivial determinant, the number of 
rows of each Dj block cannot be less than the number of 
columns of each Oj block. In this case each Dj block of 
GJ^^A^p contains k{j+l) rows and each Oj block contains 
St=o Pi columns. Consequently if the condition in 0 is 
satisfied, then k{j + 1) > X]t=o Pt i ^ implies that 
G^^A^p has a non-trivial determinant and is therefore 
non-singular. Thus, Gj^^ satisfies Theorem |3] and C[n, fc, m] 
achieves d_R(m) = (n — fc)(TO -|- 1) -f 1. ■ 


Remark 7. Our construction thus far is feasible over any 
sliding window channel C'H{S, W) with delay T = W — 1 
and S < dii{W — 1). If instead T >W holds, the same code 
C[n,k,T] is still feasible with the same threshold on S i.e., 
S < dii{W — 1). However if T < W — 1, then the code is 
feasible provided that ^ < dn{T). 


A. Numerical Results 

The bound on the field size given in Theorem |5] is only 
a sufficiency constraint required for the proof. For small 
code parameters, it is possible to numerically verify whether 
Theorem [2 holds for a given extended generator matrix. An 
example is provided below of an MSR code over a small 
field. For the results in this section, we represent the primitive 
normal elements in the field as elements within the polynomial 
ring. 

Example 2. Let a = X \ be a primitive normal element 
in F 211 = F 2 /(A^^ -f -f 1). We construct the following 


extended generator matrix 


GEX 


/ aW 


a[6l 

aPl\ 

aP^ 


aPl 

a[8l 



aPl 

a[31 

V aW 


a[31 

awy 


This matrix satisfies Theorem\^ making it the generator for an 
MSR code C[4, 2,1]. Theorem\^guarantees the construction if 
M = 2^^, i.e., a is a primitive normal element 0 /F 22048 . 


Example 3. Let a = X + 1 be a primitive normal element in 
F 211 = F 2 / {X^^ -f X^ -f 1). The extended generator G 2 ^ is 
given by 


/ a 

aP^ aP^ 

aW 


a[6l 

aPl 

a[8l\ 


aW aP^ 

a[6l 


a[8l 


a[io] 


a 

aW 


a[31 

aW 

a[5l 


aPl 

a[3l 



a[6l 






a 

aW 

a[2l 

V 





a[3l 

aW / 


This matrix satisfies Theorem\^ making it the generator for an 
MSR code C[3, 2,2]. Theorem^guarantees the construction if 
M = 2^^, i.e., a is a primitive normal element 0 /F 22048 . 

The construction in Theorem |6] can generate MSR codes 
over much smaller field sizes than than those given by the 
bound of M > g"("*+2)-i Table |T] provides a list of code 
parameters and the field sizes on which they satisfy the matrix 
multiplication property of Theorem [2 


VII. Conclusion 

We introduce a new distance metric for convolutional codes 
called the column sum rank distance. We prove several prop¬ 
erties analogous to the column Hamming distance. A new 
family of codes — MSR Codes — that achieve the maximum 
distance up to the code memory is proposed. This is the 
rank metric counterpart to the m-MDS convolutional code. 
Our construction is based on matrices over extension fields 
that preserve super-regularity after multiplication with block 
diagonal matrices in the ground field. 

The proof requires large field sizes but we numerically 
show that MSR codes do exist over smaller fields. Future 
work involves pursuing a more detailed study on field size 
requirements. Moreover, we have only considered a specific 
class of rank-deficient sliding window channels. In single¬ 
link streaming over burst erasure or mixed erasure channels, 
structured constructions using m-MDS codes as constituents 
have been revealed as more powerful alternatives i). A similar 
study pertaining MSR codes remains an interesting direction 
of further study. 


Appendix A 

Mathematical Preliminaries 
A. Finite Fields 

For M > 0 and a prime power q, let F^ be the finite 
field with q elements and F^m be an extension field of F^. A 
primitive element a G F^m is one whose consecutive powers 
can generate all non-zero elements of that field, i.e., F^m = 
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Parameters [n, fc,m] 

Field Bound 

Achievable Field 

a 

[4,2,1] 

F22O48 

F 211 =F 2 /(V^"+X" + 1) 

x + \ 

[3,2,2] 

F22O48 

F 211 =F2/(V“+X2 + 1) 

X + l 

[3,1,2] 

F22O48 

F 211 =F2/(V11+X2 + 1) 

X + l 

[2,1,2] 

F 212 S 

F 27 = F2/(x7 + a:3 + i) 

X^ + 1 

[2,1,1] 

F 264 

F 25 =F2/(X5+V2 + 1) 

V + 1 


TABLE I: Achievable field sizes under which codes constructed using Theorem |6l The bound required by the theorem for 
each set of code parameters is provided in the middle column. 


{0, a, ..., Let Fq[A'] be a polynomial ring of the 

ground field. The minimal polynomial of a primitive element 
a is the lowest degree monic polynomial Pa{X) G Fq[X] for 
which a is a root. The minimal polynomial is irreducible and 
the degree of Pa(X) is equal to M. 

Lemma 4. If f{a) = 0 for any f{X) G Fq[X], then PaiX) \ 
fiX). 

Proof: The proof can be found in ESI Chapter 4]. ■ 

The extension field F^m is isomorphic to the M- 
dimensional vector space F^ over the ground field. Let 
ag,... ,aM-i C FqM map to a basis for the vector space. 
A basis is defined as being normal when for 0 < i < m — 1, 
each ai = for some a G F^m . The generating element a 
is referred to as a normal element. The notation = a'^ 
is used to describe the i-th Frobenius power of a. Every 
element / G F^m can be written as a linear combination in 
Fq of the basis elements. LFsing the normal basis, / resembles 
a linearized polynomial f(X) = G Fq[X] 

evaluated at the normal element X = a. The coefficients of 
this polynomial can be mapped 

M-l 

f{oi) = Y. /.aW ^ f = (/o,..., (33) 

i=0 

to the entries of a unique vector f G F^^^. This mapping 
can be extended to vector spaces over the extension field and 
matrix spaces over the ground field. Using (l3Tt . we define 
: F^m —>■ as a bijection transforming a vector of 

linearized polynomial entries to a matrix whose columns are 
the coefficients of the polynomials. 

For every finite extension of a finite field, there exists at 
least one element that is both a normal and a primitive element 
Wl- Such an element is referi'ed to as being primitive normal. 
The properties of both normal and primitive elements are 
inherited in a primitive normal and will be useful in our code 
construction. 

A linearized polynomial is defined by the property that every 
monomial term must have a Frobenius power. A linearized 
polynomial possesses a g-degree, denoted as degq/(Ar), 
which gives the largest Frobenius power of the polynomial. 

B. Super-regular Matrices 

For 6 S N, let u be a permutation of the set {0,..., 6 — 1}. 
A permutation is comprised of a series of transpositions, which 
are defined as two entries of the set switching positions. The 
sign function of a permutation measures its parity, i.e., sgn (cr) 


is equal to 1 when a is constructed from an even number of 
transpositions, and equal to —1 otherwise. Let St, denote the 
set of all possible permutations. The determinant of a 6 x & 
matrix D can be calculated by summing over all permutations 
in Sb in the Leibniz formula 

b-l 

det D = Y sgn (o') H • (^4) 

(7^Sh 2 — 0 

Each product ni=o is referi'ed to as a term in the 

summation. When every term is equal to 0, the matrix is said 
to have a trivial determinant. Using the Leibniz formula, a 
super-regular matrix is a matrix for which every square sub¬ 
matrix with a non-trivial determinant is non-singular. 

References 

[1] R. Mahmood, A. Badr, and A. Khisti, “Convolutional codes in rank 
metric for network streaming,” in IEEE Int. Symp. Inf. Theory (ISIT), 
2015, to appear in 2015. 

[2] E. Martinian and C.-E. W. Sundberg, “Burst erasure correction codes 
with low decoding delay,” IEEE Trans. Inf. Theory, vol. 50, no. 10, pp. 
2494-2502, 2004. 

[3] D. Leong, A. Qureshi, and T. Ho, “On coding for real-time streaming 
under packet erasures,” in IEEE Int. Symp. Inf Theory (ISIT), 2013, pp. 
1012-1016. 

[4] A. Badr, R Patil, A. Khisti, W. Tan, and J. Apostolopoulos, “Layered 
constructions for low-delay streaming codes,” IEEE Trans. Inf. Theory, 
to appear in 2015. 

[5] V. Tomas, J. Rosenthal, and R. Smarandache, “Decoding of convolu¬ 
tional codes over the erasure channel,” IEEE Trans. Inf. Theory, vol. 58, 
no. 1, pp. 90-108, 2012. 

[6] M. Ellis, D. P. Pezai'os, and C. Perkins, “Performance analysis of AL- 
FEC for RTP-based streaming video traffic to residential users,” in IEEE 
Packet Video Workshop, 2012. 

[7] H. Gluesing-Luerssen, J. Rosenthal, and R. Smarandache, “Strongly- 
MDS convolutional codes,” IEEE Trans. Inf. Theory, vol. 52, no. 2, pp. 
58^598, 2006. 

[8] R. Hutchinson, R. Smarandache, and J. Trumpf, “Superregular matrices 
and the construction of convolutional codes having a maximum distance 
profile,” Lin. Algebra and App., vol. 428, pp. 2585-2596, 2008. 

[9] P. Almeida, D. Napp, and R. Pinto, “A new class of super regular 
matrices and MDP convolutional codes,” Lin. Algebra and App., vol. 
439, pp. 2145-2157, 2013. 

[10] R. Ahlswede, N. Cai, S. Li, and R. W. Yeung, “Network information 
flow,” IEEE Trans. Inf. Theory, vol. 46, no. 4, pp. 1204-1216, 2000. 

[11] P. A. Chou, Y. Wu, and K. Jain, “Practical network coding,” in AUerton 
Conf. on Comm., Control, Comp., 2003. 

[12] T. Ho, M. Medard, R. Kotter, D. R. Karger, M. Effros, J. Shi, and 
B. Leong, “A random linear network coding approach to multicast,” 
IEEE Trans. Inf. Theory, vol. 52, no. 10, pp. 413-^30, 2006. 

[13] E. M. Gabidulin, “Theory of codes with maximum rank distance,” Probl. 
Inf. Transm., vol. 21, no. 1, pp. 1-12, 1985. 

[14] R. M. Roth, “Maximum-rank array codes and their application to 
crisscross error correction,” IEEE Trans. Inf. Theory, vol. 37, no. 2, 
pp. 328-336, 1991. 

[15] R. Kotter and F. R. Kschischang, “Coding for errors and erasures in 
random network coding,” IEEE Trans. Inf. Theory, vol. 54, no. 8, pp. 
3579-3591, 2008. 







14 


[16] D. Silva, F. R. Kschischang, and R. Kotter, “A rank-metric approach 
to error control in random network coding,” IEEE Trans. Inf. Theory, 
vol. 54, no. 9, pp. 3951-3967, 2008. 

[17] R. W. Nobrega and B. F. Uchoa-Filho, “Multishot codes for network 
coding using rank-metric codes,” in IEEE Wireless Netw. Coding Conf. 
(WiNC), 2010, pp. 1-6. 

[18] A. Wachter-Zeh, M. Stinner, and V. Sidorenko, “Convolutional codes in 
rank metric with application to random network coding,” IEEE Trans. 
Inf. Theory, vol. 61, no. 6, pp. 3199-3213, 2015. 

[19] A. Khisti, D. Silva, and F. R. Kschischang, “Secure-broadcast codes over 
linear-deterministic channels,” in IEEE Int. Symp. Inf. Theory (ISIT), 
2010, pp. 555-559. 

[20] S. Jalali, M. Effros, and T. Ho, “On the impact of a single edge on the 
network coding capacity,” in Inf. Theory and App. (ITA), 2011, pp. 1-5. 

[21] A. Badr, D. Lui, and A. Khisti, “Streaming codes for multicast over 
burst erasure channels,” IEEE Trans. Inf. Theory, vol. 61, no. 8, pp. 
4181^208, 2015. 

[22] R. Johannesson and K. S. Zigangirov, Fundamentals of Convolutional 
Coding. IEEE Press, 1999. 

[23] S. Lin and D. J. Costello, Error Control Coding. Pearson Education 
Inc., 2004. 

[24] E. M. Gabidulin, “Convolutional codes over large alphabets,” in Proc. 
Int. Workshop on Algebraic Combinatorial and Coding Theory (ACCT), 
Varna, Bulgaria, 1988, pp. 80-84. 

[25] R. Mahmood, “Rank metric convolutional codes with applications in 
network streaming,” Master’s thesis. University of Toronto, 2015. 

[26] F. J. MacWilliams and N. J. A. Sloane, Theory of Error-Correcting 
Codes. North Holland Mathematical Library, 1977. 

[27] H. W. Lenstra and R. J. Schoof, “Primitive normal bases for finite fields,” 
Math. Comp., vol. 48, no. 177, pp. 217-231, 1987. 


Rafid Mahmood Rafid Mahmood received his B.A.Sc. and M.A.Sc. degrees 
in Electrical and Computer Engineering from the University of Toronto in 
Toronto, Ontario, Canada in 2013 and 2015 respectively. During this time, he 
foucsed his research on coding theory and information theory. He is pursuing 
his Ph.D. degree in Mechanical & Industrial Engineering at the University of 
Toronto. 


Ahmed Badr Ahmed Badr received the B.Sc., M.Sc. and Ph.D. degrees in 
Electrical & Computer Engineering from Cairo University, Egypt in 2007, 
Nile University, Egypt in 2009 and University of Toronto, Canada in 2014. 
From September 2007 to August 2009, he was a Research Assistant in the 
Wireless Intelligent Networks Center (WINC), Nile University. In September 
2009, he was a Research Assistant at the Signals Multimedia and Security 
Laboratory in University of Toronto. In 2014, he assumed his current position 
as a Postdoctoral Fellow in University of Toronto. His research interests 
include information theory, coding theory and real-time communication. 


Ashish Khisti Ashish Khisti is an assistant professor and a Canada Research 
Chair in the Electrical and Computer Engineering (ECE) department at the 
University of Toronto, Toronto, Ontario Canada. He received his BASc degree 
in Engineering Sciences from University of Toronto and his S.M and Ph.D. 
Degrees from the Massachusetts Institute of Technology (MIT), Cambridge, 
MA, USA. His research interests span the areas of information theory, wireless 
physical layer security and streaming in multimedia communication systems. 
At the University of Toronto, he heads the signals, multimedia and security 
laboratory. He is a recipient of the Ontario Early Researcher Award from the 
Province of Ontario as well as a Hewlett-Packard Innovation Research Award. 



